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We claim: 




V^l 1. A method for maintaining a data warehouse, comprising: 
identifying a data source of int€ rest; 
updating a metadata to reflect h \formation available from said source; 
automatically generating a med ator based on said metadata; and 
writing a wrapper for said sour :e which calls said mediator. 

2. The method of claim 1, whei ein the step of updating a metadata 
comprises entering new types of informatioi , new data formats for previously defined 
information, new transformations between data formats, and the schema of said source 



3. The method of claim 1, wlrerein said mediator is fully functional and is 
automatically generated by a stand-alone mediator generation program. 



4. The method of claim 3, w 



automatically defines an API and traru 



herein said mediator generation program 
ation libraries 



5. The method of claim 4, w lerein said mediator comprises code to 
translate between source and target re] >resentations, possibly using externally 
defined methods, and load data into said warehouse. 
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6. The method of claim 1, wherein said vyrapper makes use of said 

mediator. 

7. The method of claim 3, wherein s&id mediator generation program 
defines a public data representation, wherein said wrapper uses said public data 
representation. 

8. The method of claim 3, ^herein said wrapper uses said mediator to 
load data into said warehouse. 

9. A DataFoundry metadata model comprising abstractions, translations, 
mappings and database descriptions. 

10. The mofael of claim 9, comprising a UML DataFoundry metadata 
representation. 

ll/The model of claim 9, wherein said model defines metadata used by a 
mediator generation program, wherein said mediator generation program generates a 
mediator 



v 

fr( ^ e met ^ oc ^ °f c l a i m 11/ wherein said mediator generation program 
comprises: 
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reading said metadata; 
generating translation libraries, 
generating an API; and 
generating said mediator. 



13. The method of claim 12, wherein the step of reading said metadata 
comprises reading the abstraction metadata; reading the translation metadata; 
reading the database description metadata; and reading the mapping metadata. 

14. The method of cl&m 12, wherein the step of generating translation 
libraries comprises developing public and private class definitions and 
implementations of data structuresX 

15. The method of claim 14, wherein said data structures comprise said 
abstractions and said translations. \ 



16. The method of claim 12, wherein generating the mediator consists of 



creating public and private definitions and implementations of a class or classes 
capable of receiving data in one format, converting it to another format, and loading 
it into a data warehouse. 



• ... • 



lv. The method of claim 16, wherein said data is received by a receiving 
data structure defined within said translation library and said data is loaded into a 
warehouse whose schema corresponds to the database description component of the 
metadata. 

Pis applied to data 
:e and structure analysis, 
is applied to data 
maics and proteomics. 
is used for integrating a 
is used for updating a 
ed. 
DataFoundry 



23. The method of claim 14, wherein said data structures correspond to 
said abstractions and said translations. 
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24. A computer-useable medium embodying computer program code for maintaining a 
data warehouse by executing the steps of: 

identifying a data source of interest; 

updating a metadata to reflect information available from said source; 
automatically generating a mediator based on said metadata; and 
writing a wrapper for said source/which calls said mediator. 

25. The computer-useable medium of claim 24, wherein the step of 
updating a metadata comprises entering nefw types of information, new data formats 
for previously defined information, new transformations between data formats, and the 
schema of said source. 



26. The computer-useable medium of claim 24, wherein said mediator is 
fully functional and is automatically generated by a stand-alone mediator generation 
program. 



27. The computer-useable medium of claim 24, wherein said mediator 
generation program automatically defines an API and translation libraries 
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28. The computer-useable medium of claim 27, wherein said mediator 
comprises code to translate between source and targettfepresentations, possibly 
using externally defined methods, and load data inta said warehouse. 

29. The computer-useable medium ofyclaim 24, wherein said wrapper 
makes use of said mediator. 

30. The computer-useable mediyun of claim 26, wherein said mediator 
generation program defines a public dat^i representation, wherein said wrapper uses 
said public data representation. 

31. The computer-useable medium of claim 26, wherein said wrapper 
uses said mediator to load data into said warehouse. 



32. A DataFoundfy metadata model comprising abstractions, translations, 
mappings and databaseraestriptions. 



33. The 
representation. / 



Lodel of claim 32, comprising a UML DataFoundry metadata 
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34. The model of claim 32, wherein said model defines metadata used by 
a mediator generation program, wherein said mediator generation program 
generates a mediator. 



35. The model of claim 34, wherein said mediator generation program 
comprises a computer-useable medium embodyiirtg computer program code for 
maintaining a data warehouse by executing thp steps of: 

reading said metadata; 

generating translation librarie 

generating an API; and 

generating said mediator. 



36. The model of claim 35, wherein the step of reading said metadata 
comprises reading the abstraction metadata; reading the translation metadata; 
reading the database descrimion metadata; and reading the mapping metadata. 



37. The modjfl of claim 35, wherein the step of generating translation 
libraries comprises developing public and private class definitions and 
implementations of data structures. 



abstractions anl 



Le model of claim 37, wherein said data structures comprise said 
¥aid translations. 



The model of claim 35, wherein generating the mediator consists of 
creating public and private definitions and implementations of a class or classes 
capable of receiving data in one format, converting it to another format, and loading 
it into a data warehouse. 



The model of claim 38, wherein said data is received by a receiving 
data structure defined within said translation library and said data is loaded into a 
warehouse whose schema corresponds to the database description component of the 
metadata. 



%tr. The model of claim 24, wherein said method is applied to data 
warehousing applications in the domain of protein sequence and structure analysis. 



^ The model of claim 24, wherein said method is applied to data 
warehousing applications in the domain of functional genomics and proteomics. 

^f2. The model of claim 24, wherein said method is used for integrating a 
new data source into a data warehouse. 

A 

The model of clairh 24, wherein said method is used for updating a 



warehouse when a previously integrated data source is modified. 



^4. The model of claim 32, as defined by the UML DataFoundry 

\ 



epresentationl 



4p. The model of claim 37, wherein said data structures correspond to 
said abstractions and said translations. 

46. An apparatus tor maintaining a data warehouse, comprising: 
means foAidentifying a data source of interest; 

means for updating a metadata to reflect information available from said 

source; 

means for automatically generating a mediator based on said metadata; 

and 

means for writing a wrapper for said source which calls said mediator. 

The method of claim 1, wherein said method is applied to data 
warehousing applications in the yiomain of astrophysics and climate modeling. 

^f8. The method of claim 1 wherein said method is applied to data 
warehousing applications in the domain of medical image processing and analysis. 



^ 0. The method of claim 1, wherein said method is applied to data 
warehousing applications in the domain of tracking consumer and customer 
preferences. 

A 

f. The method of claim 1, wherein said method is applied to data 
warehousingyapplications in the domain of satellite and terrestial communication 
systems analysis. 

method of claim 1, wherein said method is used for integrating a 
new data source iijto a data warehouse. 

yz. The method of claim 1, wherein said method is used for updating a 
warehouse when a previously integrated data source is modified. 



